Multi-channel speaker verification based on total variability modelling
نویسندگان
چکیده
In this work we address the speaker verification task in domestic environments, monitored by multiple distributed microphones. In particular, we focus on the problem of mismatch in the propagation channel between the enrolment stage, which occurs at a fixed position, and the test phase which could happen in any location of a multi-room apartment. Building upon the Total Variability framework and cosine distance scoring, we present two multi-channel solutions: one based on multi-condition training and the other based on several channel-dependent systems. An experimental analysis on a multi-channel multi-room reverberant data-set shows that the proposed solutions are robust against changes in the speaker position and orientation and improve the performance of the single-channel matched baselines.
منابع مشابه
Multi-channel i-vector combination for robust speaker verification in multi-room domestic environments
In this work we address the speaker verification task in domestic environments where multiple rooms are monitored by a set of distributed microphones. In particular, we focus on the mismatch between the training of the total variability feature extraction hyper-parameters, the enrolment stage, which occurs at a fixed position in the home, and the test phase which could happen in any location of...
متن کاملQuantitative influence of speech variability factors for automatic speaker verification in forensic tasks
Regarding speaker identity in forensic conditions, several factors of variability must be taken into account, as peculiar intra-speaker variability, forced intra-speaker variability or channel-dependent external influences. Using ‘AHUMADA’ large speech database in Spanish, containing several recording sessions and channels, and including different tasks for 100 male speakers, automatic speaker ...
متن کاملSpeaker verification based on broad phonetic categories
In this work we present a speaker verification system based on 4 broad phonetic categories: vowels+diphthongs, fricatives, glides+nasals, and silence+stops. Using these categories separately, it is observed that vowels, diphthongs, and fricatives are the most important categories for speaker verification. This observation confirms the results from the analysis of speaker and channel variability...
متن کاملFactor analysis based channel compensation in speaker verification
This report describes a powerful channel compensation method for the text-independent speaker verification task. This powerful method is developed in the LRDE Speaker Verification framework. The purpose of a text-independent speaker verification system is to check whether a hypothesised speaker is really the author of a speech utterance. The channel compensation problem arises when training dat...
متن کاملFeature-based and channel-based analyses of intrinsic variability in speaker verification
We explore how intrinsic variations (those associated with the speaker rather than the recording environment) affect textindependent speaker verification performance. In a previous paper we introduced the SRI-FRTIV corpus and provided speaker verification results using a Gaussian mixture model (GMM) system on telephone-channel speech. In this paper we explore the use of other speaker verificati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015